Representation of Arabic Words - An Approach Towards Probabilistic Root-Pattern Relationships
نویسنده
چکیده
In the traditional Arabic NLP a root-pattern relationship has generally been considered as a simple relationship, whereas the potential aspect of considering it as a statistical measure has extensively been neglected and even never formally considered. This paper attempts therefore to explore some issues involved in considering the classical phenomenon of Arabic root-pattern relationships as probabilistic measures. Some novel probabilistic measures in the context of Arabic NLP will be introduced with respect of their semantic potential as uncertain relations capturing some root related Arabic word-forms probabilistically.
منابع مشابه
Towards Supporting Exploratory Search over the Arabic Web Content: The Case of ArabXplore
Due to the huge amount of data published on the Web, the Web search process has become more difficult, and it is sometimes hard to get the expected results, especially when the users are less certain about their information needs. Several efforts have been proposed to support exploratory search on the web by using query expansion, faceted search, or supplementary information extracted from exte...
متن کاملDetection and Correction of Non-Words in Arabic: a Hybrid Approach
As Arabic is known for its highly inflectional morphological structure, this hybrid approach is utilizing morphological knowledge in form of consistent rootpattern relationships, and some morpho-syntactical knowledge based on affixation and morphographemic rules to specify the word recognition and nonword correction process. Furthermore this paper is proposing novel probabilistic measures for c...
متن کاملUnsupervised Induction of Arabic Root and Pattern Lexicons using Machine Learning
We describe an approach to building a morphological analyser of Arabic by inducing a lexicon of root and pattern templates from an unannotated corpus. Using maximum entropy modelling, we capture orthographic features from surface words, and cluster the words based on the similarity of their possible roots or patterns. From these clusters, we extract root and pattern lexicons, which allows us to...
متن کاملTowards a new Approach for Arabic root extraction: Exploit relations between the word letters and their placement in the word for Arabic root extraction
This paper presents a new root-extraction approach for Arabic words. The approach tries to assign for Arabic words a unique root without relying on a database of word roots, a list of word patterns or a list of all the prefixes and the suffixes of the Arabic words. Unlike most of Arabic rule-based stemmers, it tries to predict the root-letters positions one by one based on some rules and relati...
متن کاملAgreement and Plural features in Heritage Arabic Speakers
31 Agreement and Plural features in Heritage Arabic Speakers Studies of heritage speakers of Spanish and Russian have reported that verbal and nominal morphology are vulnerable areas for L1 loss and incomplete acquisition. In this paper, we will report on on-going experimental research on the vulnerability of nominal and verbal agreement in Heritage Arabic speakers, since Arabic presents comple...
متن کامل